Attorney Docket No.: 500.36683CX1 
Serial No.: 10/025,743 

IN THE SPECIFICATION: 

Please replace the paragraph beginning on page 1, line 9 with the following 
rewritten paragraph: 



-Fig. 13 shows an arrangement of a general information processing 
system as a prior art. A processor 1 and a memory controller 2 are connected 
by a system bus 110, the memory controller 2 and a memory 3 are connected 
by a memory bus 111, and the memory controller 2 and another system are 
connected by an IO bus (not shown). The processor 1 of the present system 
includes an on-chip cache (which will be referred to as the L1 cache, 
hereinafter) 12, and an L2 cache 14 connected to the system bus 110. The 
memory controller 2 performs connection control not only over the memory 3 
and L2 cache 14 but also over the other system. The operation of the 
processor 1 of reading an instruction code (which operation will be referred to 
as fetch, hereinafter) is summarized as follows. The processor 1 issues a 
memory access request to the memory controller 2 via the instruction 
processing part 11 and system bus 110. The memory controller 2, in response 
to the request, reads an instruction code from the L2 cache 14 or memory 3 
and transmits it to the processor 1 . An access size between the processor 1 
and memory 3 is influenced by the L1 cache 12 so that the reading of the code 
from the memory 3 is carried out on every line size basis as the management 
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unit of the L1 cache 12. Most processors are typically equipped with an L2 

cache provided outside the processor core as a relatively high-speed memory 

in addition to an L1 cache. The word 'cache 1 as used herein refers to a 

memory which stores therein an instruction code once accessed by a memory 

to realize a high-speed access to the same code in the case of an occurrence 

of the re-access to the same code. In order to perform arithmetic operation, the 

processor also makes access not only to such an instruction code but also to 

various sorts of data including operands and to external registers. Even these 

data is stored in an cache in some cases. Such a technique is already 

implemented in many systems including a personal computer as a typical 

example. - 



Please replace the paragraph beginning on page 2, line 23 with the following 
rewritten paragraph: 



-In an information processing system, in addition to the arithmetic 
operation performance of a processor, the reading performance of an 
instruction code from a memory to the processor is also important. A delay 
from the access request of the processor to the acceptance of the data thereof 
is known as access latency. In these years, the core performance of the 
processor has been remarkably improved, but an improvement in the supply 
capability of the instruction code from the access memory is still insufficient. 
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When the access latency becomes unnegligible due to a performance 

difference between the both, the operation of the processor stalls, which 

disadvantageously results in that the processor cannot fully exhibit the 

performances and thus the memory system becomes a bottleneck in the 

system. Such access latency problems occur not only for the instruction fetch 

but also for data or register operands. -- 



Please replace the paragraph beginning on page 3, line 14 with the following 

rewritten paragr aph: 

-Conventional methods for improving access latency include first to fourth 
methods as will be described. ~ 



Please replace the paragraph beginning on page 3, line 25 with the following 
rewritten paragraph: 



-The second improvement method is to speed up the memory. In order to 
speed up the memory, one may speed up the operation of the memory per se and 
also use a cache as the memory. However, such a high-speed memory as a high- 
speed SRAM or a processor-exclusive memory is expensive, which undesirably 
involves an increase in the cost of the entire system. Meanwhile the cache has 
problems based on its principle as follows. That is, the cache is effective after once 
accessed and is highly useful when repetitively accessed. In particular, a program 
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to be executed on a so-called embedded processor tends to have a low locality of 
references, the re-use frequency of an instruction code is low and thus the cache 
memory cannot work effectively. This causes the instruction code to have to be read 
out directly from the memory, for which reason this method cannot make the most of 
the high-speed feature of the cache. Further, such a high-speed cache memory 
used as a high-speed SRAM or a processor-exclusive memory is expensive. 
Though the price/performance ratio of the memory is improved, the employment of 
the latest high-speed memory involves high costs. An increasingly large capacity of 
memory has been demanded by the system in these years. Thus the cost increase 
becomes a serious problem.-- 



Please replace the paragraph beginning on page 9, line 27 with the following 
rewritten paragraph: 

-Fig. 1 is a general block diagram of an embodiment of the present invention. 
The present embodiment is an example wherein a memory 3 stores therein an 
instruction code to be executed on a processor 1 and data such as operands to 
perform a prefetching operation for an instruction code access. 

Please replace the paragraph beginning on page 1 1 , line 8 with the following 
rewritten paragraph: 



-The memory controller 2 will be described in detail below.- 
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Please replace the paragraph beginning on page 1 1 , line 23 with the following 

rewritten paragraph: 

-The control circuit 5 performs control over the entire memory controller. The 

control circuit 5 also performs read-ahead control from the instruction code memory 

32, in addition to control over the switch circuits 6, 9, memory control circuits 21, 22, 

system bus control circuit 20, etc. Details of the control circuit 5 will be explained in 

connection with Figs. 4, 8 and 9.-- 



Please replace the paragraph beginning on page 13, line 25 with the following 
rewritten paragraph: 

-Explanation will now be made as to an implementation example of the 
access judgement circuit. Fig. 2 is a block diagram of an example of the access 
judgement circuit 4 in the memory controller 2 of Fig. 1 in the present invention. The 
access judgement circuit 4 has a prefetch hit judgement circuit 41 and an instruction 
fetch detection circuit 42. The prefetch hit judgement circuit 41 has a prefetch 
address register 41 1 for storing therein the address of the prefetched instruction 
code and a comparator 412 for comparing the address accessed by the processor 
with the address prefetched by the memory controller. When both addresses 
coincide with each other, the prefetch hit judgement circuit 41 judges it as a prefetch 
hit. The instruction fetch detection circuit 42 has an instruction-code memory area 
address register 421 for storing therein an upper address indicative of the instruction 
code memory area and a comparator 422 for comparing the upper address of the 
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instruction-code memory area address register 421 with the address accessed by 
the processor.-- 



Please replace the paragraph beginning on page 14, line 18 with the following 
rewritten para graph: 

-Although not illustrated in FIG. 2, the access judgement circuit further 
includes an access read/write judgement circuit. When a coincidence is found in the 
comparison and the access is of a read type, the judgement circuit can determine it 
as an instruction code fetch. For example, in the case where the instruction code 
memory area is from 100 00000H to 10FF FFFFH, 10H as upper 8 bits of the upper 
address is previously set in the instruction-code memory area address register 421, 
an access to the instruction code area can be detected from the comparison result 
of the upper 8 bits of the address accessed by the processor. The setting of the 
instruction-code memory area address register 421 is required only once at the time 
of the initialization setting.- 



Please replace the paragraph beginning on page 15, line 5 with the following 
rewritten paragr aph: 

-As described above, the present embodiment provides that detection of the 
instruction code fetch is carried out by judging whether or not the access address of 
the processor is placed in the instruction code memory area, the detection of the 
fetch access of the instruction code and the prefetch hit judgement are carried out at 
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the same time, whereby access judging operation can be realized with a small 

overhead time.— 



Please replace the paragraph beginning on page 15, line 27 with the following 
rewritten paragraph: 

-An explanation will now be provided as to a control circuit for performing 
read-ahead control, transfer control over the processor, and control over the entire 
memory controller. Fig. 4 is a block diagram of an example of the control circuit 5 in 
the memory controller in the present invention of Fig. 1 . The control circuit 5 
includes a prefetch address generation circuit 51, a prefetch sequencer 52 and a 
selector 53.— 



Please replace the paragraph beginning on page 16, line 20 with the following 
rewritten paragraph: 

-The subject matter of this method is to calculate an address to be accessed 
next thus not to restrict the access size to the line size of the level-i cache. Further, 
the line size value 51 1 may be a fixed value or a variable value by a register. The 
prefetch sequencer 52, on the basis of information received from the system bus 
control line or access judgement circuit 4, executes a memory access and a prefetch 
from the memory according to the access of the processor.-- 
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Please replace the paragraph beginning on page 17, line 6 with the following 
rewritten paragraph: 

--Fig 5 is a block diagram of an example of the buffer memory 8 in the 
memory controller of the present invention. In some processors, it is impossible to 
read addresses sequentially from its smaller address in a burst read access of level- 
1 cache filling operation. This is because the most critical instruction code is read 
ahead. For example, when it is desired to read 32-bit data having continuous 
addresses 0, 1, 2 and 3; the data may not be read in the address ascending order of 
0, 1 , 2 and 3 but may be read in an address order of 2, 3, 0 and 1 . In order to solve 
such an access problem, in the present example, the buffer memory 8 includes a 
plurality of buffer memories having a width equal to the access size of the processor. 
More specifically, in the example, an instruction code is assumed to consist of 32 
bits, 4 channels of buffer memories 0 to 3 each having a 32-bit width are provided so 
that data are stored in the buffer memories sequentially from the buffer memory 0 at 
the time of reading from a memory, whereas, data transfer is carried out in an order 
requested by the processor in the processor transfer mode. As a result, the present 
invention can flexibly be compatible with any proces sor access system.- 

Please replace the paragraph beginning on page 18, line 2 with the following 
rewritten paragraph: 
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—Fig. 6 is a block diagram of another embodiment of the memory controller of 
the present invention. The present embodiment provides that the memory controller 
2 includes an instruction decoder circuit 43 for decoding and analyzing an instruction 
code transferred from the instruction code memory 32 to the memory controller 2 
and also includes a branching buffer memory 84. The instruction decoder circuit 43 
detects presence or absence of a branch instruction such as branch or jump in the 
transferred instruction code. The control circuit 5, when the instruction decoder 
circuit 43 detects a branch instruction, reads ahead an instruction code at the branch 
destination into the branching buffer memory 84. The access judgement circuit 4, in 
the presence of an instruction code access from the processor, judges whether or 
not it is found in the normal read-ahead buffer memory 8 or in the branching buffer 
memory 84. In the case of a hit, the control circuit 5 transfers the instruction code 
from the hit buffer memory to the processor. As a result, even when a branch takes 
place in the processor, performance deterioration caused by stall can be improved. - 



Please replace the paragraph beginning on page 18, line 25 with the following 

rewritten paragraph: 

-Fig. 7 is a block diagram of another embodiment of the memory controller of 
the present invention. The present embodiment provides that a buffer memory and 
a control circuit are includes not only for the instruction code area but also for the 
data memory area and register area, individually.— 
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Please replace the paragraph beginning on page 19, line 3 with the following 
rewritten paragraph : 

-An access from the processor is divided by the switch circuit 90 into 
accesses to the instruction code area, data area and register areas. The access 
judgment circuit 4 judges a hit in each buffer memory. The access judgement circuit 
4 can be easily implemented in substantially the same manner as in the embodiment 
of Figs. 3 and 4. The control circuit 5 has a data access control circuit 501 , an 
instruction code access circuit 502 and an I/O control circuit 503. Each control 
circuit has a sequencer for prefetch control to implement a prefetch for each area. 
Further, even switch circuits 61, 62, 63, direct paths 71 , 72, 73 and buffer memories 
81, 82, 83 are provided for each area.-- 

Please replace the paragraph beginning on page 20, line 8 with the following 
rewritten paragraph: 

-Next, an explanation will be provided of the operation of the prefetch 
sequencer 52 by referring to Figs. 8 and 9. Fig. 8 shows a flowchart of an 
exemplary operation of the prefetch sequencer 52 in Fig. 4. This exemplary 
flowchart shows when data corresponding to one access size is prefetched from an 
address following the current access for preparation of the next access at the time of 
occurrence of the access to the instruction code area.— 
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Please replace the paragraph beginning on page 20, line 17 with the following 

rewritten paragraph: 



-When a processor access takes place, the prefetch sequencer 52 first 
judges whether or not this access is a read access to the instruction code area (step 
201). The judgement is implemented, e.g., by means of address comparison, and 
its comparison circuit is implemented with the access judgement circuit 4. In the 
case of the read access to the instruction code area, the sequencer judges whether 
or not a prefetch hit occurs (step 202). Even for this judgement, a judgement result 
of the access judgement circuit 4 is used. In the case of a hit, the sequencer starts 
data transfer from the buffer within the memory controller to the processor (step 
203). In the case of no hit, the sequencer performs the data transfer from the 
memory to the processor via the direct path (step 204). Further, since the data 
within the prefetch buffer is not a prefetch hit data, the prefetch buffer is cleared 
(step 205).- 

Please replace the paragraph beginning on page 22, line 13 with the following 
rewritten paragraph: 

-This embodiment provides that continuous instruction codes estimated to be 
accessed next are fetched until the buffer becomes full of the codes to reach its full 
storage capacity (buffer full). In this conjunction, it is desirable to set the buffer 
capacity to be an integer multiple of the access size. As a result, since the transfer 
between the memory and the buffer memory of the memory controller can be carried 
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^\ out with a relatively long burst size at a time, the need for performing the read-ahead 
operation for each instruction code access from the processor can be eliminated and 
control can be facilitated.-- 



Please replace the paragraph beginning on page 22, line 25 with the following 
rewritten paragraph: 

-Fig. 10 is a timing chart showing an exemplary memory access of the 
present invention. In this example, the prefetch effect at the time of the memory 
access will be explained by comparing it with that of the prior art. It is assumed 
herein as an example that the processor reads an instruction code through two burst 
read accesses for each cache line size on the basis of continuous addresses of from 
0000 to 001 F. Four words of '0000' in the first access and 4 words of '0010' are 
burst-read respectively in 4 cycles.-- 

Please replace the paragraph beginning on page 28, line 4 with the following 
rewritten paragraph: 

-The read-ahead to the memory controller is carried out at the time of a 
processor access so that, at the time of a read-ahead hit, the data of the buffer 
memory is transferred to the processor and at the same time, an address to be 
accessed next by the processor is estimated to perform the read-ahead operation 
from the buffer to the buffer memory. At the time of a read-ahead error, the data is 
transferred from the memory directly to the processor and at the same time, the data 

13 



Attorney Docket No.: 500.36683CX1 
Serial No.: 10/025,743 

of the buffer memory is cleared, an address to be accessed next by the processor is 

estimated to perform the read-ahead operation from the memory to the buffer 

memory. As a result, at the time of a read-ahead error, the read-ahead access can 

be realized simultaneously with the access to the processor, whereby the system 

can cope with continuous access requests from the processor.- 
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